AITopics | provide feedback

Collaborating Authors

provide feedback

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Benchmarking the Robustness of Agentic Systems to Adversarially-Induced Harms

Nöther, Jonathan, Singla, Adish, Radanovic, Goran

arXiv.org Artificial IntelligenceOct-8-2025

Ensuring the safe use of agentic systems requires a thorough understanding of the range of malicious behaviors these systems may exhibit when under attack. In this paper, we evaluate the robustness of LLM-based agentic systems against attacks that aim to elicit harmful actions from agents. To this end, we propose a novel taxonomy of harms for agentic systems and a novel benchmark, BAD-ACTS, for studying the security of agentic systems with respect to a wide range of harmful actions. BAD-ACTS consists of 4 implementations of agentic systems in distinct application environments, as well as a dataset of 188 high-quality examples of harmful actions. This enables a comprehensive study of the robustness of agentic systems across a wide range of categories of harmful behaviors, available tools, and inter-agent communication structures. Using this benchmark, we analyze the robustness of agentic systems against an attacker that controls one of the agents in the system and aims to manipulate other agents to execute a harmful target action. Our results show that the attack has a high success rate, demonstrating that even a single adversarial agent within the system can have a significant impact on the security. This attack remains effective even when agents use a simple prompting-based defense strategy. However, we additionally propose a more effective defense based on message monitoring. We believe that this benchmark provides a diverse testbed for the security research of agentic systems. The benchmark can be found at github.com/JNoether/BAD-ACTS

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2508.16481

Country:

Europe > Germany > Saarland > Saarbrücken (0.14)
Europe > United Kingdom > England (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Leisure & Entertainment > Games (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area (1.00)
Banking & Finance > Trading (0.92)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Potential Negative Societal Impacts

Neural Information Processing SystemsSep-29-2025, 02:09:10 GMT

In addition, users may become overly dependent on the model's outputs For the feedback, we ask the person "Please consider the quality of the Given a score (1-5). 1 means its quality is bad, and 5 means its quality is very good". The interface of the user study is shown in Fig. A1. We report the average scores in Tab. We have a total of 1.1M training data in FIRE. In Fig. A2, we present the curves of A T, A TR, A TR, and RR using different Results show that more data leads to better performance.

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.05)

Genre: Research Report > New Finding (0.34)

Industry:

Education (0.47)
Social Sector (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

Steered Generation via Gradient Descent on Sparse Features

Bhattacharyya, Sumanta, Rooshenas, Pedram

arXiv.org Artificial IntelligenceFeb-25-2025

Large language models (LLMs) encode a diverse range of linguistic features within their latent representations, which can be harnessed to steer their output toward specific target characteristics. In this paper, we modify the internal structure of LLMs by training sparse autoencoders to learn a sparse representation of the query embedding, allowing precise control over the model's attention distribution. We demonstrate that manipulating this sparse representation effectively transforms the output toward different stylistic and cognitive targets. Specifically, in an educational setting, we show that the cognitive complexity of LLM-generated feedback can be systematically adjusted by modifying the encoded query representation at a specific layer. To achieve this, we guide the learned sparse embedding toward the representation of samples from the desired cognitive complexity level, using gradient-based optimization in the latent space.

arxiv preprint arxiv, representation, survey article, (15 more...)

arXiv.org Artificial Intelligence

2502.18644

Country: North America > United States > Illinois (0.14)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems

Zhang, Mike, Dilling, Amalie Pernille, Gondelman, Léon, Lyngdorf, Niels Erik Ruan, Lindsay, Euan D., Bjerva, Johannes

arXiv.org Artificial IntelligenceFeb-18-2025

Providing high-quality feedback is crucial for student success but is constrained by time, cost, and limited data availability. We introduce Synthetic Educational Feedback Loops (SEFL), a novel framework designed to deliver immediate, on-demand feedback at scale without relying on extensive, real-world student data. In SEFL, two large language models (LLMs) operate in teacher--student roles to simulate assignment completion and formative feedback, generating abundant synthetic pairs of student work and corresponding critiques. We then fine-tune smaller, more computationally efficient LLMs on these synthetic pairs, enabling them to replicate key features of high-quality, goal-oriented feedback. Unlike personalized tutoring approaches that offer multi-turn, individualized instruction, SEFL specifically focuses on replicating the teacher-->student feedback loop for diverse assignments. Through both LLM-as-a-judge and human evaluations, we demonstrate that SEFL-tuned models outperform their non-tuned counterparts in feedback quality, clarity, and timeliness. These findings reveal SEFL's potential to transform feedback processes for higher education and beyond, offering an ethical and scalable alternative to conventional manual feedback cycles.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.12927

Country:

North America > Mexico > Mexico City > Mexico City (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > Singapore (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry:

Education > Educational Setting (1.00)
Education > Assessment & Standards (0.88)
Education > Educational Technology > Educational Software > Computer Based Training (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

Here's how ed-tech companies are pitching AI to teachers

MIT Technology ReviewSep-3-2024, 09:00:00 GMT

But this year, more and more educational technology companies are pitching schools on a different use of AI. Rather than scrambling to tamp down the use of it in the classroom, these companies are coaching teachers how to use AI tools to cut down on time they spend on tasks like grading, providing feedback to students, or planning lessons. One company, called Magic School, says its AI tools like quiz generators and text summarizers are used by 2.5 million educators. Khan Academy offers a digital tutor called Khanmigo, which it bills to teachers as "your free, AI-powered teaching assistant." Teachers can use it to assist students in subjects ranging from coding to humanities.

artificial intelligence, ed-tech company, natural language, (9 more...)

MIT Technology Review

Country:

North America > United States > North Carolina (0.06)
North America > United States > Colorado (0.06)
North America > United States > California (0.06)
(3 more...)

Genre: Instructional Material (0.73)

Industry: Education > Educational Setting (1.00)

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.53)
Information Technology > Artificial Intelligence > Natural Language (0.33)

Add feedback

Continually Improving Extractive QA via Human Feedback

Gao, Ge, Chen, Hung-Ting, Artzi, Yoav, Choi, Eunsol

arXiv.org Artificial IntelligenceNov-3-2023

We study continually improving an extractive question answering (QA) system via human user feedback. We design and deploy an iterative approach, where information-seeking users ask questions, receive model-predicted answers, and provide feedback. We conduct experiments involving thousands of user interactions under diverse setups to broaden the understanding of learning from feedback over time. Our experiments show effective improvement from user feedback of extractive QA models over time across different data regimes, including significant potential for domain adaptation.

machine learning, natural language, question answering, (21 more...)

arXiv.org Artificial Intelligence

2305.12473

Country:

North America > United States > Gulf of Mexico > Central GOM (0.15)
North America > United States > Illinois > Cook County > Chicago (0.05)
Asia > Japan > Kyūshū & Okinawa > Okinawa (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Sports > Baseball (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.35)

Add feedback

Smart tutor to provide feedback in programming courses

Roldán-Álvarez, David

arXiv.org Artificial IntelligenceOct-12-2023

Artificial Intelligence (AI) is becoming more and more popular as time passes, allowing to perform tasks that were difficult to do in the past. From predictions to customization, AI is being used in many areas, not being educational environments outside this situation. AI is being used in educational settings to customize contents or to provide personalized feedback to the students, among others. In this scenario, AI in programming teaching is something that still has to be explored, since in this area we usually find assessment tools that allow grading the students work, but we can not find many tools aimed towards providing feedback to the students in the process of creating their program. In this work we present an AI based intelligent tutor that answers students programming questions. The tool has been tested by university students at the URJC along a whole course. Even if the tool is still in its preliminary phase, it helped the students with their questions, providing accurate answers and examples. The students were able to use the intelligent tutor easily and they thought that it could be a useful tool to use in other courses.

artificial intelligence, programming course, provide feedback, (1 more...)

arXiv.org Artificial Intelligence

2301.09918

Genre: Research Report (0.66)

Industry: Education > Educational Setting (0.53)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

An Ontology of Co-Creative AI Systems

Lin, Zhiyu, Riedl, Mark

arXiv.org Artificial IntelligenceOct-11-2023

The term co-creativity has been used to describe a wide variety of human-AI assemblages in which human and AI are both involved in a creative endeavor. In order to assist with disambiguating research efforts, we present an ontology of co-creative systems, focusing on how responsibilities are divided between human and AI system and the information exchanged between them. We extend Lubart's original ontology of creativity support tools with three new categories emphasizing artificial intelligence: computer-as-subcontractor, computer-as-critic, and computer-as-teammate, some of which have sub-categorizations.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2310.07472

Genre: Research Report (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.84)
Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.58)

Add feedback

How sure is sure? Incorporating human error into machine learning

AIHubSep-4-2023, 11:23:26 GMT

Human error and uncertainty are concepts that many artificial intelligence systems fail to grasp, particularly in systems where a human provides feedback to a machine learning model. Many of these systems are programmed to assume that humans are always certain and correct, but real-world decision-making includes occasional mistakes and uncertainty. Researchers from the University of Cambridge, along with The Alan Turing Institute, Princeton, and Google DeepMind, have been attempting to bridge the gap between human behaviour and machine learning, so that uncertainty can be more fully accounted for in AI applications where humans and machines are working together. This could help reduce risk and improve trust and reliability of these applications, especially where safety is critical, such as medical diagnosis. The team adapted a well-known image classification dataset so that humans could provide feedback and indicate their level of uncertainty when labelling a particular image.

artificial intelligence, machine learning, simulation of human behavior, (15 more...)

AIHub

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.25)
North America > Canada > Quebec > Montreal (0.05)

Industry: Health & Medicine (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)
Information Technology > Artificial Intelligence > Cognitive Science > Simulation of Human Behavior (0.41)

Add feedback